Search CORE

130 research outputs found

Discussion of ``2004 IMS Medallion Lecture: Local Rademacher complexities and oracle inequalities in risk minimization'' by V. Koltchinskii

Author: Tsybakov A. B.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/08/2007
Field of study

Discussion of ``2004 IMS Medallion Lecture: Local Rademacher complexities and oracle inequalities in risk minimization'' by V. Koltchinskii [arXiv:0708.0083]Comment: Published at http://dx.doi.org/10.1214/009053606000001064 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Estimation of matrices with row sparsity

Author: Klopp O.
Tsybakov A. B.
Publication venue
Publication date: 01/01/2015
Field of study

An increasing number of applications is concerned with recovering a sparse matrix from noisy observations. In this paper, we consider the setting where each row of the unknown matrix is sparse. We establish minimax optimal rates of convergence for estimating matrices with row sparsity. A major focus in the present paper is on the derivation of lower bounds

arXiv.org e-Print Archive

HAL-Polytechnique

Learning by mirror averaging

Author: Juditsky A.
Rigollet P.
Tsybakov A. B.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2006
Field of study

Given a finite collection of estimators or classifiers, we study the problem of model selection type aggregation, that is, we construct a new estimator or classifier, called aggregate, which is nearly as good as the best among them with respect to a given risk criterion. We define our aggregate by a simple recursive procedure which solves an auxiliary stochastic linear programming problem related to the original nonlinear one and constitutes a special case of the mirror averaging algorithm. We show that the aggregate satisfies sharp oracle inequalities under some general assumptions. The results are applied to several problems including regression, classification and density estimation.Comment: Published in at http://dx.doi.org/10.1214/07-AOS546 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Hal - Université Grenoble Alpes

Hal-Diderot

Penalized maximum likelihood and semiparametric second-order efficiency

Author: Dalalyan A. S.
Golubev G. K.
Tsybakov A. B.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2006
Field of study

We consider the problem of estimation of a shift parameter of an unknown symmetric function in Gaussian white noise. We introduce a notion of semiparametric second-order efficiency and propose estimators that are semiparametrically efficient and second-order efficient in our model. These estimators are of a penalized maximum likelihood type with an appropriately chosen penalty. We argue that second-order efficiency is crucial in semiparametric problems since only the second-order terms in asymptotic expansion for the risk account for the behavior of the ``nonparametric component'' of a semiparametric procedure, and they are not dramatically smaller than the first-order terms.Comment: Published at http://dx.doi.org/10.1214/009053605000000895 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Variable selection with Hamming loss

Author: Butucea Cristina
Ndaoud Mohamed
Stepanova Natalia A.
Tsybakov Alexandre B.
Publication venue
Publication date: 01/10/2018
Field of study

We derive non-asymptotic bounds for the minimax risk of variable selection under expected Hamming loss in the Gaussian mean model in

\mathbb{R}^d

for classes of

s

-sparse vectors separated from 0 by a constant

a > 0

. In some cases, we get exact expressions for the nonasymptotic minimax risk as a function of

d, s, a

and find explicitly the minimax selectors. These results are extended to dependent or non-Gaussian observations and to the problem of crowdsourcing. Analogous conclusions are obtained for the probability of wrong recovery of the sparsity pattern. As corollaries, we derive necessary and sufficient conditions for such asymptotic properties as almost full recovery and exact recovery. Moreover, we propose data-driven selectors that provide almost full and exact recovery adaptively to the parameters of the classes

arXiv.org e-Print Archive

Carleton University's Institutional Repository

Aggregation by exponential weighting, sharp PAC-Bayesian bounds and sparsity

Author: A. B. Juditsky
A. B. Tsybakov
A. B. Tsybakov
A. B. Tsybakov
A. B. Tsybakov
A. Dalalyan
A. Dalalyan
A. Dembo
A. Nemirovski
B. Efron
D. L. Donoho
D. Revuz
E. Candes
E. Greenshtein
E. L. Lehmann
F. Bunea
F. Bunea
F. Bunea
G. Leung
I. E. Frank
J. Kivinen
J. Obloj
J.-Y. Audibert
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Littlestone
O. Catoni
T. Zhang
T. Zhang
V. V. Petrov
V. Vovk
V. Vovk
Y. Yang
Y. Yang
Y. Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/03/2008
Field of study

We study the problem of aggregation under the squared loss in the model of regression with deterministic design. We obtain sharp PAC-Bayesian risk bounds for aggregates defined via exponential weights, under general assumptions on the distribution of errors and on the functions to aggregate. We then apply these results to derive sparsity oracle inequalities

arXiv.org e-Print Archive

CiteSeerX

Crossref

Hal-Diderot

Estimating Mutual Information

Author: A. B. Tsybakov
A. Hyvärinen
A. Renyi
A. Ziehe
Alexander Kraskov
B. van Es
B. W. Silverman
E. S. Dudewicz
G. A. Darbellay
Harald Stögbauer
J. C. Correa
J.-F. Cardoso
J.-F. Cardoso
L. F. Kozachenko
O. Vasicek
Peter Grassberger
R. L. Dobrushin
R. L. Somorjai
R. Steuer
R. Wieczorkowski
T. M. Cover
W. H. Press
Publication venue: 'American Physical Society (APS)'
Publication date: 28/05/2003
Field of study

We present two classes of improved estimators for mutual information

M(X,Y)

, from samples of random points distributed according to some joint probability density

\mu(x,y)

. In contrast to conventional estimators based on binnings, they are based on entropy estimates from

k

-nearest neighbour distances. This means that they are data efficient (with

k=1

we resolve structures down to the smallest possible scales), adaptive (the resolution is higher where data are more numerous), and have minimal bias. Indeed, the bias of the underlying entropy estimates is mainly due to non-uniformity of the density at the smallest resolved scale, giving typically systematic errors which scale as functions of

k/N

for

N

points. Numerically, we find that both families become {\it exact} for independent distributions, i.e. the estimator

\hat M(X,Y)

vanishes (up to statistical fluctuations) if

\mu(x,y) = \mu(x) \mu(y)

. This holds for all tested marginal distributions and for all dimensions of

x

and

y

. In addition, we give estimators for redundancies between more than 2 random variables. We compare our algorithms in detail with existing algorithms. Finally, we demonstrate the usefulness of our estimators for assessing the actual independence of components obtained from independent component analysis (ICA), for improving ICA, and for estimating the reliability of blind source separation.Comment: 16 pages, including 18 figure

arXiv.org e-Print Archive

Crossref

Juelich Shared Electronic Resources

Regularization of statistical inverse problems and the Bakushinskii veto

Author: Bakushinskiĭ A B
Bauer F
Bauer H
Cavalier L
Cavalier L
Evans S N
Feller W
Groetsch C W
Hofinger A
Ivanov V K
Ledoux M
Lepskiĭ O V
Mathé P
Mathé P
Morozov V A
Revuz D
S M A Becker
Schumaker L L
Tikhonov A N
Tsybakov A B
Vai˘nikko G M
Publication venue: 'IOP Publishing'
Publication date: 01/01/2010
Field of study

In the deterministic context Bakushinskii's theorem excludes the existence of purely data driven convergent regularization for ill-posed problems. We will prove in the present work that in the statistical setting we can either construct a counter example or develop an equivalent formulation depending on the considered class of probability distributions. Hence, Bakushinskii's theorem does not generalize to the statistical context, although this has often been assumed in the past. To arrive at this conclusion, we will deduce from the classic theory new concepts for a general study of statistical inverse problems and perform a systematic clarification of the key ideas of statistical regularization.Comment: 20 page

arXiv.org e-Print Archive

Crossref

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Repositorium für Naturwissenschaften und Technik

Iteratively regularized Newton-type methods for general data misfit functionals and applications to Poisson data

Author: A Antoniadis
A Tsybakov
AB Bakushinskiĭ
AB Bakushinskiĭ
B Blaschke
B Hofmann
B Hofmann
B Kaltenbacher
C Brune
D Colton
D Paganin
E Resmerita
F Bauer
F Bauer
Frank Werner
J Flemming
JM Bardsley
JM Borwein
K Giewekemeyer
M Benning
M Bertero
M Burger
M Hanke
M Hanke
M Hegland
MV Klibanov
NE Hurt
O Ivanyshyn
O Scherzer
P Massart
P Mathé
P Mathé
P Reynaud-Bouret
RI Bot
T Hohage
T Hohage
Thorsten Hohage
ZB Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We study Newton type methods for inverse problems described by nonlinear operator equations

F(u)=g

in Banach spaces where the Newton equations

F'(u_n;u_{n+1}-u_n) = g-F(u_n)

are regularized variationally using a general data misfit functional and a convex regularization term. This generalizes the well-known iteratively regularized Gauss-Newton method (IRGNM). We prove convergence and convergence rates as the noise level tends to 0 both for an a priori stopping rule and for a Lepski{\u\i}-type a posteriori stopping rule. Our analysis includes previous order optimal convergence rate results for the IRGNM as special cases. The main focus of this paper is on inverse problems with Poisson data where the natural data misfit functional is given by the Kullback-Leibler divergence. Two examples of such problems are discussed in detail: an inverse obstacle scattering problem with amplitude data of the far-field pattern and a phase retrieval problem. The performence of the proposed method for these problems is illustrated in numerical examples

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector